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Field 

[0001] The present invention relates to a digital circuit, and more particularly to a 

static receiver having a variable inversion threshold. 

Background 

[0002] In computer systems, it is required to drive relatively large on-chip loads, 

such as for example, a long interconnect. On-chip interconnects, such as, for example, 
buses and bitlines, are found in virtually all components making up a computer system. 
Consider the computer system illustrated in Fig. 1. In Fig. 1, die 102 comprises a 
microprocessor with many sub-blocks, such as arithmetic logic unit (ALU) 104 and on- 
die cache 106. Die 102 may also communicate to other levels of cache, such as off-die 
cache 108. Higher memory hierarchy levels, such as system memory 110, are accessed 
via host bus 112 and chipset 114. In addition, other functional units not on die 102, such 
as graphics accelerator 116 and network interface controller (NIC) 118, to name just a 
few, may communicate with die 102 via appropriate buses or ports. Each of these 
functional units may physically reside on one die or more than one dice. Some or parts of 
more than one functional unit may reside on the same die. Each of these various 
described components requires driving large loads, such as long interconnects. As clock 
frequencies for the various chips and buses in a computer system increase, high-speed 
signaling by heavily loaded drivers presents challenges. 

[0003] Full-swing, on-chip signal signaling schemes may generally be considered 

as either dynamic or static. A dynamic signaling scheme may be abstracted in Fig. 2a, 
where a dynamic driver comprising pFET (Field-Effect-Transistor) 202, nFET 204, and 
nFET 206 either charges or discharges interconnect 208 to HIGH (Vcc) or LOW (Vss). 
Interconnect 208 may be a long interconnect or a large load, for example. A clock signal 
is provided to the gates of pFET 202 and nFET 206. Data is provided to the gate of nFET 
204. During a pre-charge phase, the clock signal is LOW so that pFET 202 charges 
interconnect (load) 208 HIGH. During an evaluation phase, the clock signal is HIGH so 
that pFET 202 switches OFF, and interconnect 208 is conditionally discharged LOW 
depending upon the data signal provided to the gate of nFET 204. 
[0004] Although dynamic signaling schemes are usually relatively fast, they may 

consume a considerable amount of power even when the data activity is zero (e.g., when 
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the data signal is HIGH over a number of clock cycles). This is due to the power 
dissipated in clocking the pre-charge devices, as well as the unnecessary full-swing 
transitions on the interconnects. 

[0005] In static schemes, drivers, repeaters, and receivers are typically simple 

CMOS static gates, such as inverters. A static scheme may be abstracted in Fig. 2b, 
where driver 210 and receiver 213 are static inverters. Interconnect 214 may be a large 
interconnect or large load, for example. Unlike a dynamic scheme, there is no significant 
power consumption for zero data activity. Furthermore, there is no clocked device that 
leads to clock power dissipation. However, in contrast to a dynamic scheme, both rising 
and falling signals received at the input of receiver 212 should be evaluated equally fast. 
Consequently, it has been desirable in static schemes that receivers should be 
symmetrical with an inversion threshold in the middle of the power rails. That is, 
denoting the HIGH and LOW power rail voltages as Vcc and Vss, respectively, the 
inversion threshold has historically been set at (Vcc-Vss)/2. But for signals with slow 
edge rates, there may be an undesirable delay before a signal reaches the inversion 
threshold for a symmetrical receiver. This may result in considerable delay and degrade 
the performance of a bus. 

[0006] A fast, asymmetrical static receiver has been proposed in Tomofumi lima, 

et al., "Capacitance Coupling Inmiune, Transient Sensitive Accelerator for Resistive 
Interconnect Signals of Subquarter Micron ULSI," IEEE Joumal of Solid State Circuits, 
Vol. 31., no. 4, April 1996, pp 531-536, and is shown in Fig. 2c. However, there are some 
disadvantages to the circuit of Fig. 2c. This will be addressed after describing the 
proposed embodiments. 

Brief Description of the Drawings 
[0007] Fig. 1 illustrates a computer system. 

[0008] Figs. 2a, 2b, and 2c illustrate prior art signaling schemes. 

[0009] Fig. 3 is a receiver according to an embodiment of the present invention. 

[0010] Figs. 4a and 4b illustrate effective circuits for the receiver of Fig. 3 as seen 

by a falling signal. 

[0011] Fig. 5 illustrates an effective circuit for the receiver of Fig. 3 as seen by a 

rising signal. 
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[0012] Fig. 6 is a receiver according to another embodiment of the present 

invention. 

[0013] Fig. 7 is a modification to the receiver of Fig. 3 according to an 

embodiment of the present invention. 

[0014] Fig. 8 is a modification to the receiver of Fig. 6 according to an 

embodiment of the present invention. 

Description of Embodiments 
[0015] Fig. 3 illustrates a static receiver 302 according to an embodiment of the 

present invention. The input port of receiver 302 may be taken at node 304, which 
receives signals transmitted by static driver 306 over interconnect 308. The output port of 
receiver 302 is indicated by numeral 310. Static inverters 312 and 314 are symmetrical 
inverters, and may be realized by simple CMOS inverters. Whether or not a simple 
CMOS inverter is used to reaUze inverter 314, when describing the operation of receiver 
302, it is convenient to consider inverter 314 as a simple CMOS inverter as shown within 
the dashed Une labeled as 314' and to denote the device transconductance of pFET 328 as 
Pp and the device transconductance of nFET 330 as Pn. For synmietrical operation, pp 
and Pn should be substantially equal to each other. Also shown in Fig. 3 are power rails 
316 and 318 with voltages Vcc (HIGH) and Vss (LOW), respectively. Although not 
shown in Fig. 3, some embodiments may employ a non-inverting delay element in- 
between the output port of inverter 312 and the gates of transistors 320 and 322. 
[0016] Operation of receiver 302 is easily described as follows. Suppose a rising 

voltage has aheady been received at input port (node) 304 so that input port 304 is at 
Vcc. Inverter 312 provides voltage Vss to the gates of pFET 320 and nFET 322, so that 
pFET 320 is ON and nFET 322 is OFF. With nFET 322 OFF, nFET 326 is effectively 
isolated from power rail 318. With pFET 320 ON, a low impedance path is provided 
from the source of pFET 324 to power rail 316. Because a static scheme is employed, 
under normal operation node 304 will stay at Vcc until there is a change in the 
transmitted data, i.e., when a falling signal is received at input port 304. For purposes of 
considering now a falling signal received at input port 304, the effective circuit seen 
looking into input port 304 of receiver 302 may be taken as that shown in Fig. 4a. 
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[0017] Referring to Fig. 4a and to the representation 314' in Fig. 3 of inverter 

314, the input signal at input port 304 effectively sees the simple CMOS inverter of Fig. 
4b with an effective nFET device transconductance Pn and an effective pFET device 
transconductance (Pp + pp'), where pp' is the effective device transconductance of the 
series combination of pFETs 324 and 320. Because pp and pn are substantially equal to 
each other, it follows that pn < (pp + pp'). Consequently, the inversion threshold of 
receiver 302 is effectively raised so that receiver 302 is skewed in favor of the falling 
signal received at node 304. For a given size for inverter 314, the amount of skew is 
controlled by the size of pFETs 324 and 320. By sizing these pFETs appropriately, a 
significant skew may be realized. 

[0018] Now consider another change in the transmitted data so that a rising signal 

is received at input port 304. Then, by a similar argument as described above, the input 
signal at input port 304 effectively sees the simple CMOS inverter of Fig. 5 with an 
effective nFET transconductance (pn + Pn'), where pn' is the effective nFET . 
transconductance of the series combination of nFETs 326 and 322, and with an effective 
pFET transconductance of pp. It then follows that pp < (Pn + pn'), so that now receiver 
302 is skewed in favor of the rising signal received at node 304. By sizing nFETs 326 and 
322 appropriately, a significant skew may be realized. 

[0019] From the above discussion, it is observed that the ratio of effective pFET 

device transconductance to effective nFET device transconductance as seen by a received 
signal is made to depend upon a previously received signal when a data change occurs. 
As a result, a falling received signal sees a ratio (Pp + pp')/ pn > 1, and a rising received 
signal sees a ratio Pp/(pn + Pn') < 1 . Consequently, receiver input-to-output transitions 
may be increased by appropriately sizing nFETs 322 and 326, and pFETs 320 and 324, 
with a resulting decrease in signal transmission delay. 

[0020] Fig. 6 illustrates a static receiver 602 according to another embodiment of 

the present invention. Suppose input port (node) 604 is already at Vcc. Then pFET 606 is 
OFF and transmission gate 608 is ON, so that there is a low impedance path coupling the 
gate of pFET 610 to input port 604. Also, nFET 612 is ON and transmission gate 614 is 
OFF, so that nFET 616 is held OFF and its gate is isolated fi-om input port 604. The 
effective circuit looking into input port 604 is that of Fig. 4b, where now pp' is the device 
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transconductance of pFET 610. Like the receiver of Fig. 3, the receiver of Fig. 6 is 
skewed in favor of a falling signal received at input port 604. Similarly, if input node 604 
is already at Vss, the effective circuit looking into input port 604 is that of Fig. 5, where 
now pn' is the device transconductance of nFET 616, and the receiver of Fig. 6 is skewed 
in favor of a rising received signal at input port 604. Consequently, the receiver input-to- 
output transition of Fig. 6, like that of Fig. 3, may be increased by appropriately sizing 
nFET 616 and pFET 610, with a resulting decrease in signal transmission delay. 
However, the embodiment of Fig. 3 results in a simpler layout, with less routing and 
fewer transistors. 

[0021] For both embodiments of Figs. 3 and 6, as discussed earlier with respect to 

Fig. 3, the ratio of effective device transconductances as seen by a received signal is 
made to depend upon a previously received signal when a data change occurs. As a 
result, this variable nature of the ratio of effective device transconductances may 
introduce output ghtches if the edge rate of the received signal is too slow. This may be 
mitigated by modifying the embodiments of Figs. 3 and 6, as shown in Figs. 7 and 8, 
respectively. 

[0022] The operation of the receiver of Fig. 7 is not unlike that of Fig. 3. 

However, in Fig. 7, the combination of symmetrical inverter 705, transistors 702 and 704, 
and transistors 706 and 708 with their gates connected to the output port of inverter 710, 
results in a asymmetrical inverter with a raised inversion threshold for a rising signal at 
node 712 and a lowered inversion threshold for a falling signal at node 712. In contrast, 
inverter 312 of Fig. 3 is a symmetrical inverter with fixed inversion threshold. As a result, 
the variable inversion threshold for the asymmetrical inverter comprising symmetrical 
inverter 705, and transistors 702, 704, 706, and 708, may be designed to ensure that a 
received signal at input node 712 has made a sufficiently complete transition before the 
receiver of Fig. 7 is "reconfigured" in favor of the next input signal transition. 
[0023] Similarly, the operation of the receiver of Fig. 8 is not imlike that of Fig. 

6, except that resulting asymmetrical inverter comprising symmetrical inverter 805, and 
transistors 802, 804, 806, and 808, has a raised inversion threshold for a rising signal at 
node 812 and a lowered inversion threshold for a falling signal at node 812. As for the 
receiver of Fig. 7, this ensures that a received signal at input node 812 has made a 
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sufficiently complete transition before the receiver of Fig. 8 is reconfigured in favor of 
the next input signal transition. 

[0024] Referring now to Fig. 2c, its operation should be clear in Ught of the above 

description of the disclosed embodiments, where it is noted that element 224 in Fig. 2c is 
a non-inverting delay element. There are some disadvantages to the receiver of Fig. 2c 
when compared to the disclosed embodiments. The voltages at nodes 216 and 218 may 
be vulnerable to different sources of coupling noises. As one example, consider the case 
in which a HIGH signal is received at node 220, and where node 216 was previously 
charged to Vcc due to a previously received LOW signal at node 220. After a short time 
delay, when the received HIGH signal has had time to propagate via delay element 224, 
the gate terminal of nFET 222 makes a LOW to HIGH transition. This LOW to HIGH 
transition may couple into node 216 by way of the gate-to-channel capacitance of nFET 
222, causing a voltage overshoot at node 216. Now, suppose in the next signaling time a 
LOW signal is received, so that node 220 now makes a HIGH to LOW transition. Before 
this newly received signal has had time to propagate through delay element 224 to set up 
the circuit in a new configuration, node 216 is supposed to be pulled LOW via nFET 222. 
But, the voltage overshoot on node 216 adds a delay to pulling down node 216, thereby 
possibly degrading performance. But perhaps more importantly, the voltage overshoot 
above Vcc at node 216 stresses the gate oxide of pFET 228, and may eventually damage 
pFET 228. A similar discussion applies to node 218. In contrast, the receiver of Fig. 6 (as 
well as Fig. 8) does not have this problem of developing voltage overshoots at nodes 618 
and 620 because the voltage transitions on the two gate terminals of a transmission gate 
(e.g., transmission gates 608 or 614) are complementary to each other. 
[0025] As another example, it is noted that in VLSI chips, such as those used in 

the various components of the computer system illustrated in Fig. 1, there may be many 
independent signals propagating on neighboring interconnects on the same or 
neighboring interconnect layers within a VLSI chip. As a consequence, there is often a 
significant likelihood of noise coupling from one interconnect to another, and therefore 
robustness against such coupling noises is desirable. However, under some circumstances 
the receiver of Fig. 2c does not exhibit such robustness. For example, suppose node 216 
has been charged to Vcc because a LOW signal was received at node 220. Suppose now 
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that a HIGH signal is received at node 220, so that node 230 is now HIGH. After 
propagation of the received signal through delay element 224, pFET 226 is switched OFF 
and nFET 222 is switched ON. Until another signal transition is received, nFET 222 is 
supposed to keep node 216 at Vcc. But, nFET 222 cannot charge node 216 higher than 
Vcc-Vt, where Vt is the threshold voltage of nFET 222. Thus, if coupling noise were to 
drop the voltage at node 216 to Vcc-Vt, there is no mechanism to recover the voltage 
back to Vcc. Consequently, pFET 228 will start to conduct, causing considerable 
contention with puUdowns in inverter 232. This wastes power, and may result in 
degrading the speed performance. In contrast, the receiver of Fig. 6 (as well as Fig. 8) 
does not have this problem because the transmission gate keeps the node at Vcc. Note 
that the topology of Fig. 3 (as well as Fig. 7) is very different from that of Fig. 2c, and it 
does not suffer from the problems discussed above. 

[0026] Various modifications may be made to the disclosed embodiments within 

the scope of the invention as claimed below. In the claims below, it is to be understood 
that the meaning of ">4 is connected to 5" is that A and B are connected by a passive 
structure for making a direct electrical connection so that the voltage potential of A and B 
are substantially equal to each other. For example, A and B may be connected by way of 
an interconnect, transmission line, etc. In integrated circuit technology, the "interconnect" 
may be exceedingly short, comparable to the device dimension itself For example, the 
gates of two transistors may be connected to each other by polysilicon or copper 
interconnect that is comparable to the gate length of the transistors. 
[0027] It is also to be understood that the meaning of is coupled to 5" is that 

either A and B are connected to each other as described above, or that, although A and B 
may not be connected to each other as described above, there is nevertheless a device or 
circuit that is connected to both A and B. This device or circuit may include active or 
passive circuit elements. For example, A may be connected to a circuit element which in 
tum is connected to 5. 
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